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KAW SEQUENCE LISTING ^ATE ; 0 V14/2 002 

PATENT APPLICATION: US/10/066,007 TIME. 18.39.10 

Input set : N:\Crf3\RULE60\10066007.txt 
output set: N:\CRF3\02142002\J066007.raw 

3 <110:> APPLICANT: HOSHINO, Tatsuo 

4 OJIMA, Kazuyuki 

5 SETOGUCHI, Yutaka 

7 120^> TITLE OF INVENTION: ASTAXANTHIN SYNTHETASE 
9 --130- FILE REFERENCE: ASTAXANTHIN SYNTHETASE 
11 .140- CURRENT APPLICATION NUMBER: US/10/066,007 
C--> 12 <141> CURRENT FILING DATE: 2001-02-01 

14 -[ISOy PRIOR APPLICATION NUMBER: US/09/518,386 

15 ■'151'> PRIOR FILING DATE: 2000-03-03 

18 -'ISO'^ PRIOR APPLICATION NUMBER: EP 99104668.1 

19 ••151> PRIOR FILING DATE: 1999-03-09 

21 -;150> PRIOR APPLICATION NUMBER: EP 00101666.6 

22 ^:151> PRIOR FILING DATE: 2000-02-01 
24 •a60 > NUMBER OF SEQ ID NOS : 32 

26 -:170:> SOFTWARE: PatentIn Ver . 2.1 

28 •:210-> SEQ ID NO: 1 

29 *:211> LENGTH: 557 

30 -:::12:> TYPE: PRT 

31 ■:213> ORGANISM: Phaffia rhodozyma 
3 3 <220> FEATURE: 

34 <221> NAME/KEY: TRANSIT 

35 <222> LOCATION: (1)..(26) 

37 <400 > SEQUENCE: 1 , „ i o>,^ 

38 Met Phe lie Leu Val Leu Leu Thr Gly Ala Leu Gly Leu Ala Ala Phe 

.- TO J- ^ 



41 ser Trp Ala Ser He Ala Phe Phe Ser L^u Tyr Leu Ala Pro Arg Arg 

I] ser ser Leu T^r Asn Leu Gin Gly Pro Asn His Thr Asn Tyr Phe Thr 
ll Gly Asn Phe Leu Asp He Leu Ser Ala Arg Thr Gly Glu Glu His Ala 



50 Lys T^r Arg Glu Lys Tyr Gly Ser Thr Leu Arg Phe Ala Gly He Ala 
53 Gly Ala Pro Val Leu aIh Ser Thr Asp Pro Lys Val Phe Asn His Val 

56 Met Lys Glu Ala Tyr Asp Tyr Pro Lys Pro Gly Met Ala Ala Arg Val 

inR J-iU 

57 



100 



59 Leu Arg He Ala Thr Gly Asp Gly Val Val Thr Ala Glu Gly Glu Ala 
Lys ill His Arg Arg He Met He Pro Ser Leu Ser Ala Gin Ala 

65 val ill ser Met Val Pro He Phe Leu Glu Lys G^ly Met Glu Leu Val 

66 145 150 155 



62 His 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/066,007 



DATE: 02/14/2002 
TIME: 18:39:10 



Input Set : N:\Crf3\EULE60\10066007.txt 
output set: N:\CRF3\02142002\J066007.raw 

68 ASP Lys Met Met Glu Asp Ala Ala Glu Lys Asp Met Ala Val Gly Glu 

71 Ser Ala Gly Glu S Lys Ala Thr Arg Leu Glu Thr Glu Gly Val Asp 

74 val Lys ASP ^rp Val Gly Arg Ala ihr Leu Asp Val Met Ala Leu Ala 

77 Gly Phe i!p Tyr Lys Ser Asp Ser Leu Gin Asn Lys Thr Asn Glu Leu 

80 Tyr vfl Ala Phe Val Gly Leu Thr Asp Gly Phe Ala Pro Thr Leu Asp 

83 ser Phe Lys Ala He Me^ Trp Asp Phe Val Pro Tyr Phe Arg Thr Met 

24 5 ^ ->\J 

86 Lys Arg Arg His Glu He Pro Leu Thr Gin Gly Leu Ala Val Ser Arg 

89 Arg Val Gly lie Glu Leu Met Glu Gin Lys Lys Gin Ala Val Leu Gly 

92 ser Ala sir Asp Gin Ala Val Asp Lys Lys Asp Val Gin Gly Arg Asp 

95 lie Leu Ser Leu Leu Val ArJ Ala Asn He Ala Ala Asn Leu Pro Glu 

98 ser Gin Lys Leu Ser Asp Glu Glu Val Leu Ala Gin He Ser Asn Leu 

101 Leu Phe Ala Gly^Tyr Glu Thr Ser Ser Thr Val Leu Thr Trp Met Phe 

i°04 His Arg Leu Ser Glu Asp Lys Ala Val Gin Asp Lys Leu Arg Glu Glu 

i°07 lie cys Gin He Asp Thr Asp M^t Pro Thr Leu Asp Glu Leu Asn Ala 

no Leu pro Tyr Leu Glu Ala Phe Val Lys Glu Ser Leu Arg Leu Asp Pro 

H3 pro ser Pro Tyr Ala Isn Arg Glu Cys Leu Lys Asp Glu Asp Phe He 

lie pro Leu Ala Glu Pro Val He Gly Arg Lp Gly Ser Val lie Asn Glu 

lis val Arg He ^hr Lys Gly Thr Met tal Met Leu Pro Leu Phe Asn He 

^22 Asn Arg Ser Lys Phe He Tyr c'ly Glu Asp Ala Glu Glu Phe Arg Pro 

i's Glu Irg Trp Leu Glu Asp Val Thr Asp Ser Leu Asn Ser He Glu Ala 

ioe pro Tyr Gly His Gin lla Ser Phe He Ser Gly Pro Arg Ala Cys Phe 

in Gly Trp Arg Phe ll! Val Ala Glu Met Lys Ala Phe Leu Phe Val Thr 

^34 Leu Arg Arg VaJ Gin Phe Glu Pro Tie He Ser His Pro Glu Tyr Glu 

i37 His He Thr Leu He He Ser Arg Pro Arg He Val Gly Arg Glu Lys 

30 535 w>4U 

^40 Glu Gly Tyr Gin Met Arg Leu Gin Val Lys Pro Val Glu 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/IO/066 , 007 



DATE: 02/14/2002 
TIME: 18:39:10 



input set : N:\Crf3\RULE60\10066007.txt 
output set: N:\CRF3\02l42002\J066007.raw 



141 
144 • 
145 
146 
147 
149 
150 
151 
153 
154 
155 
157 
158 
159 
161 
162 
163 
164 
166 
167 
168 
170 
171 
172 
174 
17 5 
176 
178 
179 
180 
182 
183 
184 
186 
187 
188 
190 
191 
192 
194 
195 
196 
198 
199 
2 0 0 
202 
203 
204 
206 



545 
<.210-> 
< 211-^ 
•.212:> 
-..213> 
•.220-.> 
^.221-> 
<222'> 
<220^ 
..:221-^ 
'.:222^ 
•.:220- 
^221^ 
•222 
400 



550 

SEQ ID NO: 2 
LENGTH: 1932 
TYPE : DNA 

ORGANISM: Phaffia rhodozyma 
FEATURE : 
NAME/KEY: CDS 
LOCATION: (33).. (1706) 
FEATURE : 

NAME/KEY: polyA_site 
LOCATION: (1871) 
FEATURE 1 
NAME/KEY: mRNA 
. LOCATION: (14).. (1891) 
SEQUENCE : 2 



555 



gaattcggca cgaggccacc tactttctcc at 



aca 
Thr 

ttc 

Phe 

ggc 
Gly 
40 
tea 
Ser 

age 
Ser 

acc 
Thr 

ccq 
Pro 

Gly 
12 0 
atg 
Met 



ggt get 

Gly Ala 
10 

agt ctt 
Ser Leu 



25 
ccg 



aat 



Pro Asn 

get cgt 
Ala Arg 

acc etc 
Thr Leu 



gat eeg 
Asp Pro 

90 
eet 
Pro 



tta ggc 
Leu Gly 

tae etc 
Tyr Leu 

eat aee 
His Thr 

aea ggt 
Thr Gly 
60 

egg ttt 
Arg Phe 

75 
aaa gtc 
Lys Val 



ctg get 
Leu Ala 

get eeg 
Ala Pro 
30 

aae tae 
Asn Tyr 

45 
gaa gag 
Glu Glu 



get 
Ala 
15 
agg 
Arg 

ttt 
Phe 

cat 
His 



get ggg ate 
Ala Gly He 



tt e 
Phe 



aaa 
Lys 
105 

gtt gtt 
Val Val 

ate ecc 
lie Pro 

tta gaa 
Leu Glu 



get gag aag 



ggt atg 
Gly Met 

acg gcg 
Thr Ala 

tet etg 
Ser Leu 
140 
aaa ggt 
Lys Gly 
155 

gat atg 



ttc aac cat 
Phe Asn His 
95 

gee get cga 
Ala Ala Arg 

110 

gaa ggt gaa 
Glu Gly Glu 
125 

tee get eag 
Ser Ala Gin 

atg gaa ctt 
Met Glu Leu 

gee gtg gga 



tte 
Phe 

ega 
Arg 

aca 
Thr 

gcg 
Ala 

get 
Ala 
80 
gtg 
Val 



tte ate 
Phe He 

tgg gca 
Trp Ala 

tea etg 
Ser Leu 

35 

aat ttt 
Asn Phe 

50 

aag tae aga 
Lys Tyr Arg 

65 
gga 
Gly 



atg 
Met 
1 

tea 
Ser 

tet 
Ser 

ggc 
Gly 



ttg gtc 
Leu Val 
5 

tee ata 
Ser He 

20 

tat aae 
Tyr Asn 

tta gae 
Leu Asp 

gaa aaa 
Glu Lys 



gca ccc gtc 
Ala Pro Val 



gtg 
Val 

get 
Ala 

gee 
Ala 



gtc 
Val 

160 
gag 



atg 
Met 

etc 
Leu 

eat 
His 

gtt 
Val 
145 
gae 
Asp 



aaa gaa gee 
Lys Glu Ala 
100 

aga att get 
Arg He Ala 

115 
aag ega eat 
Lys Arg His 
130 

aag teg atg 
Lys Ser Met 



ttg 
Leu 
85 
tae 
Tyr 

aee 
Thr 

ega 
Arg 

gtc 
Val 



ttg etc 
Leu Leu 

gcg ttc 
Ala Phe 

ctt eag 
Leu Gin 

ate etc 
lie Leu 
55 

tae gga 

Tyr Gly 
70 

aac teg 
Asn Ser 



53 



101 



149 



197 



245 



293 



teg 



aag atg atg gag 
Lys Met Met Glu 
165 

gee ggt gaa aag 



gae tat 
Asp Tyr 

gga gat 
Gly Asp 

agg ate 
Arg He 
135 
cca att 
Pro He 
150 

gat gcg 
Asp Ala 



341 



389 



437 



485 



533 



aag gca 581 
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207 

2 08 

210 

211 

212 

214 

215 

216 

218 

219 

220 
22 2 
223 
224 
226 
227 
228 
2 3 0 
231 
232 
234 
235 
2 36 
2 38 
2 39 
24 0 



Ala Glu Lys 

170 

acc aga etc 
Thr Arg Leu 

185 
get act ctg 
Ala Thr Leu 
200 

teg etc cag 
Ser Leu Gin 



RAW SEQUENCE LISTING 
PATENT APPLICATION: US/10/066,007 

input set : N:\Crf3\RULE60\10066007 txt 
output set: N:\CRF3\02142002\J066007.raw 

Asp Met Ala 



DATE: 02/14/2002 
TIME: 18:39:10 



gag acc 
Glu Thr 

gac gtc 
Asp Val 



aee gat ggg 
Thr Asp Gly 

gat ttt gta 
Asp Phe Val 
250 

ttg act caa 
Leu Thr Gin 

265 
gag caa aag 
Glu Gin Lys 
280 

gat aaa aag 
Asp Lys Lys 



aac 
Asn 

ttt 
Phe 
235 
cct 
Pro 



aag 
Lys 
220 

get 
Ala 

tac 
Tyr 



gaa 
Glu 

atg 
Met 
205 
aee 
Thr 

cct 
Pro 

tte 
Phe 



242 
243 
244 

246 
247 
248 
250 
251 
2 52 
254 
255 
256 
258 
259 
260 
262 
263 
264 
266 
267 
268 
270 
271 



gca aac ate 
Ala Asn He 



gag gta etc 
Glu Val Leu 

330 

tct teg aca 
Ser Ser Thr 
345 

gcc gtt cag 
Ala Val Gin 
360 

atg cct acg 
Met Pro Thr 

gtt aag gag 
Val Lys Glu 

gaa tgc tta 
Glu Cys Leu 
410 

ggt cga gat 
Gly Arg Asp 



gga tta 
Gly Leu 

aag cag 
Lys Gin 

gat gtt 
Asp Val 
300 
gcc gcc 
Ala Ala 
315 

get cag 
Ala Gin 



gea 
Ala 



gtc ttg 
Val Leu 



gcc 
Ala 
285 
caa 
Gin 

aac 
Asn 

ate 
He 

aca 
Thr 



Val Gly 
175 
gga gtc 
Gly Val 
190 

get ett 
Ala Leu 

aat gag 
Asn Glu 

acc ttg 
Thr Leu 

cga act 
Arg Thr 

255 
gtt tec 
Val Ser 

270 

gtg ett 
Val Leu 



Glu Ser 

gat gta 
Asp Val 

gca gga 
Ala Gly 



etc tat 
Leu Tyr 
225 
gac teg 
Asp Ser 
240 

atg aaa 
Met Lys 

cga cga 
Arg Arg 

ggc tea 
Gly Ser 



Ala Gly Glu 
180 

aag gat tgg 
Lys Asp Trp 
195 

ttt gac tat 
Phe Asp Tyr 
210 

gtc get ttt 
Val Ala Phe 



tte aag get 
Phe Lys Ala 



gat aaa 
Asp Lys 

eta gac 
Leu Asp 
380 
tct ett 
Ser Leu 
395 

aag gat 
Lys Asp 

ggg teg 
Gly Ser 



ett 
Leu 
365 
gaa 
Glu 

egt 
Arg 

gaa 
Glu 

gtc 
Val 



ggt egg 
Gly Arg 

ctg cct 
Leu Pro 

agt aac 
Ser Asn 
335 
tgg atg 
Trp Met 
350 

cga gaa 
Arg Glu 



gat ate 
Asp He 

305 
gaa tct 
Glu Ser 
320 

ctg tta 
Leu Leu 

ttt cac 
Phe His 

gaa att 
Glu He 



egg aga cat 
Arg Arg His 
260 

gtt ggg ate 
Val Gly He 

275 
get tec gat 
Ala Ser Asp 
290 

eta agt etc 
Leu Ser Leu 

caa aag ctg 
Gin Lys Leu 



Lys Lys Ala 

gtc ggt cga 
Val Gly Arg 

aag age gac 
Lys ser Asp 

215 

gtc gga ett 
Val Gly Leu 

230 

ate atg tgg 
He Met Trp 
245 

gag ata cct 
Glu He Pro 



gag ett atg 
Glu Leu Met 



629 



677 



725 



773 



821 



869 



ett aat 
Leu Asn 

eta gac 
Leu Asp 

gac tte 
Asp Phe 
415 
ate aac 
He Asn 



ttt get 
Phe Ala 

cga etc 
Arg Leu 

355 
tgt cag 
cys Gin 
370 

cct tat 



eag get 
Gin Ala 

eta gtg 
Leu Val 
310 
tec gat 
Ser Asp 
325 

gga tat gaa 
Gly Tyr Glu 
340 
tea 
Ser 



gtt 
Val 
295 
aga 
Arg 

gag 

Glu 

act 
Thr 



gaa gac aaa 
Glu Asp Lys 



gcg ttg 
Ala Leu Pro Tyr 

385 
cct cct 
Pro Pro 
400 

ate cea 
He Pro 



ate 
He 

etc 
Leu 



917 



965 



1013 



1061 



1109 



agt ecg tat 
Ser Pro Tyr 



gag gtc 
Glu Val 



ett gcc gag 
Leu Ala Glu 
420 
egg ate acg 
Arg He Thr 



gac acg gat 
Asp Thr Asp 
375 

gaa gcg ttt 
Glu Ala Phe 
390 

get aac cgt 
Ala Asn Arg 
405 

cct gtc att 
Pro Val He 



aaa gga acg 
Lys Gly Thr 



1157 



1205 



1253 



1301 



1349 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/066,007 



DATE: 02/14/2002 
TIME: 18:39:10 



272 
274 
275 
276 
278 
279 
280 



283 

284 

286 

287 

288 

2 90 

291 

292 

294 

295 

296 

298 

299 
300 
302 
303 
304 
306 
308 
310 
312 
315 
316 
317 
318 
320 
321 
322 
323 
324 
325 
326 
327 
328 
329 
330 
331 
332 
333 
334 
335 



input set : N;\Crf3\RULE60\10066007.txt 
output Set: N:\CRF3\02142002\J066007.raw 

4^5 430 435 

.tn ate atQ ctt ccg ttg ttc aac ate aat cgt tea aag ttc att tat 
Me? Va^ Me? Leu Pro Leu Phe Asn He Asn Arg Ser Lys Phe He Tyr 
440 445 450 

III aaa qat gca gaa gaa ttc aga ccg gag agg tgg ctt gag gac gta 
ITy ITu ASP Ala Glu Glu Phe Arg Pro Glu Arg Trp Leu Glu Asp Val 

ara aac tcq ctc a!c agt att gaa gca ccc tat gga cac cag gcg age 
Thr Zl ser Leu Asn Ser He Glu Ala Pro Tyr Gly His Gin Ala Ser 

ttt ate tct gga ccc aga get tge ttt ggt tgg cga ttt get gte gee 
Phe lie ser Gly Pro Arg Ala Cys Phe Gly Trp Arg Phe Ala Val Ala 

490 495 500 

rr.cr ^fd aacT qcc ttc ttg ttt gte act etc cgt egg gtc cag ttc gag 
HI Me? Lys Ai: Phe Leu Phe Val Thr Leu Arg Arg Val Gin Phe Glu 

505 510 
ccc ate ate tct cat cca gag tac gag cac ate acc ttg ate att tec 
pro lie lie Ser His Pro Glu Tyr Glu His He Thr Leu He He Ser 

525 

cat cct cqa ate gtt ggt aga gag aag gag ggg tac cag atg cgt ttg 
III pro Arg He Val Gly Arg Glu Lys Glu Gly Tyr Gin Met Arg Leu 

540 545 
eag gte aag ccg gte gaa tga gttgattctt catatgttaa gagaagttct 
Gin Val Lys Pro Val Glu 

a.atct,„a at^t^t^act a,,ac«t,c =„.t»,„ to,,„t,« tctc.taccc 
lllltllZ HH'T.H timillll ;ia;:i;a,. ;.aa„c„c c,=.=,a.cc 

ggctcgtgce gaatte 
<210> SEQ ID NO: 3 
<211> LENGTH: 557 
<212:> TYPE: PRT 

."^13-> ORGANISM: Phaffia rhodozyma 

^r-phf nf val L.U L.U Thr =1V Leu Ol, L.U .1, «a Ph. 

Sel Trp Ala Ser 11^ Ala Phe Phe Ser Leu Tyr Leu Ala Pro Arg Arg 

20 25 
ser ser Leu Tyr Asn Leu Gin Gly Pro Asn His Thr Asn Tyr Phe Thr 

Gly Asn Phe Leu Asp He Leu sll Ala Arg Thr Gly Glu Glu His Ala 

Lys T^r Arg Glu Lys Tyr GlJ Ser Thr Leu Arg Phe Ala Gly He Ala 

Gly Ala Pro Val Leu A^sn Ser Thr Asp Pro Lys Val Phe Asn His Val 

Met Lys Glu Ala T^r Asp Tyr Pro Lys Pro Gly Met Ala Ala Arg Val 

Leu Arg He Ma Thr Gly Asp Gly Val Val Thr Ala Glu Gly Glu Ala 



1397 



1445 



1493 



1541 



1589 



1637 



1685 



1736 



1796 
1856 
1916 
1932 
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DATE: 02/14/2002 
TIME: 18:39:11 



VERIFICATION SUMMARY 

PATENT APPLICATION: US/10/066,007 

Input Set : N:\Crf3\RULE60\10066007.txt 
Output Set: N:\CRF3\02142002\J066007.raw 

L:12 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
(46) "n" or "Xaa" used, for SEQ ID#:16 
(46) "n" or "Xaa" used, for SEQ ID#:17 
(46) "n" or "Xaa" used, for SEQ ID#:26 
(46) "n' 



L: 727 M: 341 W 
L:770 M:341 W 
L: 909 M: 341 W 
L:942 M: 341 W 



or "Xaa" used, for SEQ ID#:27 
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